K | # of bigrams | # of trigrams | # of 4-grams | # of 5-grams | # of 6-grams |
---|---|---|---|---|---|
100 | 76 | 94 | 99 | 99 | 99 |
1000 | 324 | 686 | 887 | 955 | 980 |
10000 | 788 | 2952 | 5574 | 7430 | 8579 |
100000 | 2008 | 10130 | 27306 | 47805 | 64967 |
1000000 | 2181 | 10435 | 27797 | 48540 | 66000 |
Both the problem and the results are much similar to the previous subsection: We consider letter-N-grams at the end of words instead of the beginning.
3.8.1 Number of letter-N-grams at word beginnings